Visualization of flow trace operation across multiple sites

ABSTRACT

Some embodiments provide a method for presenting a visualization of a data message flow within a logical network that is implemented across multiple sites. The method receives flow tracing data regarding the data message flow from a source endpoint in a first site to a second endpoint in a second site. The data message flow is processed according to logical forwarding elements (LFEs) implemented in at least the first and second sites. For each of the sites through which the data message flow passes, the method identifies the LFEs that process the data message flow in the site. The method presents a visualization for the data message flow. The visualization includes a separate section for each site through which the data message flow passes. Each section indicates at least a subset of the LFEs that process the data message flow in the corresponding site for the section.

BACKGROUND

As more networks move to the cloud, it is more common for one corporation or other entity to have networks spanning multiple sites. Much like with single-site logical networks, administrators may want to troubleshoot these networks using flow tracing operations. As networks that span multiple sites may be more complicated, there are additional difficulties in properly visualizing these flow tracing operations across multiple sites.

BRIEF SUMMARY

Some embodiments of the invention provide a method for presenting a visualization of a flow tracing operation within a logical network that is implemented across multiple sites (e.g., multiple federated datacenters in different locations). The method receives flow tracing data regarding a data message flow that is sent from a source endpoint in a first one of the sites to a destination endpoint in a second one of the sites (possibly flowing through other sites as well) and that is processed according to logical forwarding elements (LFEs) implemented in at least the first and second sites. The method identifies the LFEs that process the data message flow in each site through which the data message flow passes and presents a visualization for the data message flow that includes a separate section for each of these sites. Each of these sections indicates the LFEs that process the data message flow in the corresponding site.

In some embodiments, the method is performed by a network management and control system that receives flow tracing data from physical forwarding elements (PFEs) that implement the LFEs that process the data message flow at each of the sites through which the data message flow passes. These PFEs (e.g., software and/or hardware forwarding elements) are configured by the network management and control system to send flow tracing data to a central controller or controller cluster at their respective site (e.g., directly, or via local controllers that execute on host computers alongside the PFEs). The network management and control system is also responsible for injecting flow tracing data messages at the source endpoint, which are marked (e.g., with a flag bit) in some embodiments so that (i) the PFEs that process the marked data messages know to provide flow tracing data to the network management and control system and (ii) the PFE at the destination endpoint does not actually deliver the data message to this endpoint.

The PFEs, in some embodiments, send the flow tracing data to the controllers as control messages that indicate that the forwarding element has performed a specific action or actions, such as physical receipt of a data message at a particular port, ingress of a packet to an LFE, logical forwarding of a data message according to an LFE, application of a firewall, access control, or other rule for an LFE to a data message, physical forwarding (e.g., encapsulation and output) of a data message, dropping a data message, delivery of a data message to its destination, etc. The controllers and/or network managers at each site aggregate the flow tracing data for a particular flow tracing operation and pass this data to a centralized controller and/or manager cluster in some embodiments. The centralized controller and/or manager cluster is thus able to use the data from all of the sites to generate the visualization of the processing of the data message flow across the multiple sites.

The visualization of the flow tracing operation, as mentioned, displays a separate section for each site through which the data message flow passes, as well as at least a subset of the LFEs that process the data message flow in each site. In some embodiments, these sections are presented as vertical segments of the visualization display, with the section corresponding to the source site (i.e., the site at which the source endpoint is located) on one side and the section corresponding to the destination site (i.e., the site at which the destination endpoint is located) on the other side of the display. Some embodiments also use arrows to indicate the direction of the flow, which can be useful when a data message flow is sent from a first site to one or more additional sites and then back to the first site. In this case, different embodiments either use a strictly unidirectional (e.g., left-to-right) orientation and di splay multiple sections corresponding to the first site or have the data message flow visualization indicate movement from the first section to the other sections and then back to the first section.

The LFEs that process the data message and are displayed within the sections of the visualization may include at least logical routers and logical switches. Logical switches that process a data message, in some embodiments, can include user-configured logical switches to which the network endpoints connect as well as auto-configured logical switches that are used to connect gateways of the same logical router across multiple sites (also referred to as backplane logical switches). When a logical switch (of any type) spans multiple sites, some embodiments use tunnels between logical network gateways (also referred to as remote tunnel endpoints, or RTEPs) at each site that handle communication across sites. When a data message is sent from a first PFE executing on a host computer in one site to a second PFE executing on a host computer in another site (based on logical switch processing performed by the first PFE), the first PFE transmits the data message to a first logical network gateway associated with the logical switch in the first site, which transmits the data message to a second logical network gateway associated with the logical switch in the second site via the RTEP channel, which then transmits the data message to the second PFE. Some embodiments display the logical network gateways in the flow tracing operation (e.g., as a single GUI item on the boundary between the two sites) in addition to the user-configured logical switches.

Logical routers, in some embodiments, may include tier-0 logical routers (which connect directly to external networks, such as the Internet) and tier-1 logical routers (which may be interposed between logical switches and tier-0 logical routers). Tier-1 logical routers, in some embodiments, may have only a distributed routing component (DR), or may have both distributed routing components as well as centralized routing components (also referred to as service routing components, or SRs). SRs, for tier-1 routers, allow for centralized (e.g., stateful) services to be performed on data messages sent between DCNs connected to logical switches that connect to the tier-1 logical router and either DCNs connected to other logical switches that do not connect to the tier-1 logical router or external network endpoints. Tier-1 logical routers may be connected to tier-0 logical routers in some embodiments which, as mentioned, handle data messages exchanged between the logical network DCNs and external network endpoints. These tier-0 logical routers may also have a DR as well as one or more SRs (e.g., SRs at each site spanned by the TO logical router).

When a logical router has SRs in multiple sites, a separate SR is implemented on a separate host computer in each of the sites. Some embodiments designate one SR (i.e., one site) as the primary SR for each such logical router. In this case, many data messages sent from one site to another that require processing by the SR for a particular logical router are transmitted from their originating site to the site of the primary SR via the logical network gateways for a backplane logical switch that connects all of these SRs. In this case, the visualization of some embodiments displays a representation of the logical network gateways (i.e., of the RTEP channel) between the sections corresponding to these sites, in addition to showing the secondary and primary SRs in each of the sites that process the data message flow.

For example, a tracing operation data message flow sent from a source endpoint in a first site to a destination endpoint in a second site might be processed by a first logical switch to which the source endpoint connects, the DR of a first tier-1 logical router to which the first logical switch connects, the secondary SR of the first tier-1 logical router in the first site, the primary SR of the first tier-1 logical router in a third site (as well as the logical network gateways between these two SRs), the secondary SR of a tier-0 logical router in the third site, the primary SR of the tier-0 logical router in a fourth site (as well as the logical network gateways between these two SRs), the secondary SR of a second tier-1 logical router in the fourth site, the primary SR of the second tier-1 logical router in a fifth site (as well as the logical network gateways between these two SRs), the secondary SR of a second tier-1 logical router in the fifth site, the primary SR of the second tier-1 logical router in the second site (as well as the logical network gateways between these two SRs), the DR of the second tier-1 logical router in the second site, and a second logical switch to which the destination endpoint connects. Some embodiments display each logical switch, each set of logical network gateway (i.e., each RTEP channel between these logical network gateways), and each SR (both primary and secondary SRs) in the flow tracing visualization. It should be noted that many networks are configured such that the primary SRs for multiple logical routers are in the same site, so the data message flow does not have to pass through as many sites as in the above example. Other examples where the data message flow leaves one site and then comes back to that site are also possible, as described.

In some embodiments, each of the PFEs that implements the LFEs executes on a host computer at one of the sites. Specifically, in some embodiments the logical network gateways and SRs are implemented on edge devices (i.e., computing devices that handle the connections with external networks and between sites). These edge devices implement the distributed LFEs such as logical switches and DRs as well, which are also implemented by PFEs executing on the host computers. In some embodiments, the visualization provides an option for a user to view the physical computing device (e.g., edge devices, host computer) that implements each of the displayed LFEs that processes the data message flow.

When a data message flow passes through numerous sites, the visualization of the flow tracing operation can become crowded and difficult to follow because so many sections are required. In addition, an administrator may not be interested in viewing all of the different secondary and primary SRs that process the data message flow. As such, some embodiments present the sections for intermediate sites (i.e., any site that is not the first or last site through which a data message flow passes) as collapsible within the user interface. When the number of sites is above a threshold, some embodiments automatically display these sections as collapsed while providing a selectable item for expanding the section. Other embodiments do not automatically collapse any of the sections, but provide selectable items enabling collapse of each section. A collapsed section does not display any of the LFEs that process the data message flow at the corresponding site and is replaced in the visualization with the selectable expansion item. When expanded, a section displays its LFEs as normal. In addition, logical network gateways between collapsed and expanded sections are removed from the visualization in some embodiments.

The visualization of some embodiments can include additional details about the data message flow and the LFEs and physical elements that process the data message flow. In some embodiments, for example, a selectable option is provided to cause the visualization to display a number indicating the latency between sites. Additionally, or alternatively, the visualization provides a selectable option for displaying any alarms raised in the network. When selected, the visualization displays (e.g., by querying the network management system data store) indicators for any LFEs (or PFEs) on which alarms have been raised (e.g., by presenting the GUI items for these LFEs in a different color than the other LFEs).

The preceding Summary is intended to serve as a brief introduction to some embodiments of the invention. It is not meant to be an introduction or overview of all inventive subject matter disclosed in this document. The Detailed Description that follows and the Drawings that are referred to in the Detailed Description will further describe the embodiments described in the Summary as well as other embodiments. Accordingly, to understand all the embodiments described by this document, a full review of the Summary, Detailed Description and the Drawings is needed. Moreover, the claimed subject matters are not to be limited by the illustrative details in the Summary, Detailed Description, and the Drawings, but rather are to be defined by the appended claims, because the claimed subject matters can be embodied in other specific forms without departing from the spirit of the subject matters.

BRIEF DESCRIPTION OF THE DRAWINGS

The novel features of the invention are set forth in the appended claims. However, for purpose of explanation, several embodiments of the invention are set forth in the following figures.

FIG. 1 illustrates a graphical user interface that includes such a visualization of a flow tracing operation according to some embodiments.

FIG. 2 conceptually illustrates a process of some embodiments for performing a flow tracing operation and generating a visualization of that operation to present to a user.

FIG. 3 shows a visualization 300 for a flow tracing operation in which data messages are sent from a first site to a second site and back to the first site in order to exit the network.

FIG. 4 illustrates a visualization of a flow tracing operation for a data message flow in which the source and destination endpoints connect to the same logical switch at different sites.

FIG. 5 illustrates a flow tracing operation visualization with an option selected to view the physical computing devices implementing each displayed LFE in the course of processing the flow tracing operation data messages.

FIG. 6 illustrates a visualization of a flow tracing operation for a data message flow that passes through five different sites and thus includes five sections.

FIG. 7 illustrates a visualization of a flow tracing operation for a data message flow that passes through five different sites (similar to that shown in FIG. 6 ) but with the intermediate sections collapsed.

FIG. 8 illustrates the flow tracing operation visualization of FIG. 7 after one of the sections has been expanded.

FIG. 9 illustrates a visualization of a flow tracing operation in which a user has selected options for the visualization to include (i) information indicating the latency between datacenters and (ii) indications as to which LFEs currently have alarms raised.

FIG. 10 conceptually illustrates an electronic system with which some embodiments of the invention are implemented.

DETAILED DESCRIPTION

In the following detailed description of the invention, numerous details, examples, and embodiments of the invention are set forth and described. However, it will be clear and apparent to one skilled in the art that the invention is not limited to the embodiments set forth and that the invention may be practiced without some of the specific details and examples discussed.

Some embodiments of the invention provide a method for presenting a visualization of a data message flow within a logical network that is implemented across multiple sites (e.g., multiple federated datacenters in different locations). The method receives flow tracing data regarding a data message flow that is sent from a source endpoint in a first one of the sites to a destination endpoint in a second one of the sites (possibly flowing through other sites as well) and that is processed according to logical forwarding elements (LFEs) implemented in at least the first and second sites. The method identifies the LFEs that process the data message flow in each site through which the data message flow passes and presents a visualization for the data message flow that includes a separate section for each of these sites. Each of these sections indicates the LFEs that process the data message flow in the corresponding site.

FIG. 1 illustrates a graphical user interface that includes such a visualization 100 of a flow tracing operation according to some embodiments. In this example, the data message flow that is simulated by the flow tracing operation has a source endpoint as a virtual machine (VM) connected to a logical switch (represented as a segment) in a Paris datacenter, passes through a New York datacenter, and has a destination endpoint that is a VM connected to a logical switch in a London datacenter. Each of these sites (Paris, New York, and London) are given corresponding vertical sections 105-115 in the visualization 100, with the sections separated by dashed vertical lines.

Within each of the sections 105-115, the visualization 100 displays UI items representing LFEs that process the data message flow within the corresponding site. As shown, the data message flow (i.e., data messages belonging to the data message flow) is sent from the source VM (named “src_vm”) and is initially processed by a logical switch (“paris_seg1”). This processing typically occurs at a physical forwarding element (PFE) set (e.g., a set of virtual switches and/or virtual routers), executing on the same host computer as the source VM, and identifies that the packet is logically forwarded to a tier-1 (T1) logical router (“finance_t1”). The PFE set at the host also implements a distributed router (DR) of the T1 logical router, which in this case routes the data message flow to a centralized component of the T1 logical router implemented on an edge computing device in the same site. As described further below, in some embodiments when a logical router spans multiple sites, these centralized router (also called service routers, or SRs) are implemented in each of the spanned sites and one of the sites (and therefore one of the SRs) is designated as a primary SR, with all of the others designated as secondary SRs.

The primary SR for the T1 logical router “finance_t1” is located in the New York site, so the data message flow is routed to this primary SR via a tunnel between logical network gateways (also referred to as remote tunnel endpoints, or RTEPs). The visualization 100 displays indicators for the secondary SR in the Paris section 105, the primary SR in the New York section 110, and for the RTEP channel on the boundary between these two sections. These RTEP channels are shown using a single indicator in some embodiments, though they are each implemented as separate logical network gateways in each site (usually on the same edge device as the SRs that they connect).

Within the New York site, the primary T1 SR routes the data message flow to a tier-0 (T0) logical router (“corp_t0”). As with the T1 SR, the T0 SR in some embodiments includes a DR and a set of SRs (with a primary SR in one site and secondary SRs in other sites). Here, the primary SR of the TO logical router is located at the London site, so the data message flow is routed from the secondary T0 SR in the New York site to the primary TO in the London site, again via a tunnel between logical network gateways. The visualization 100 again includes indicators for the secondary T0 SR in the New York section 110, for the primary T0 SR in the London section 115, and for the RTEP which is shown on the boundary between these two sections.

The TO router routes the data message flow to a different T1 logical router (“it_t1”), which also has its primary SR located at the London site. An indicator for the primary SR of this T1 is shown within the London section 100 of the visualization 100. This T1 logical router routes the data message flow to a logical switch (“london_seg1”) at the same London site, so this switch is shown in the visualization 100 as well. Finally, the logical switch forwards the data message flow to the destination VM (“dst_vm”), which executes on a host computer also located at the London site. It should be noted that the flow tracing operation data messages would not actually be delivered to the destination VM by the PFE set at this host computer (because they are flow tracing operation messages that do not need to be processed by the VM). Further description of the processing of data messages for logical networks that span multiple sites can be found in U.S. patent application Ser. No. 16/906,891, filed Jun. 19, 2020, which is incorporated herein by reference.

In the example visualization 100, each of the LFEs shown is indicated by a shape (e.g., circle for logical routers and logical network gateways, rectangles for the logical switches) with lines connecting the segments and the order of processing moving from left to right as well as bottom to top and then back to bottom. It should be understood that this example as well as the various other example visualizations presented in this document are merely examples and that different embodiments may include different variations on these visualizations (e.g., different shapes to represent LFEs, description text inside the UI item representing an LFE rather than outside the UI item, different colors for different types of LFEs, etc.).

FIG. 2 conceptually illustrates a process 200 of some embodiments for performing a flow tracing operation and generating a visualization of that operation to present to a user. In some embodiments, the method is performed by a network manager, network controller, or other component of a network management and control system. In different embodiments, a single network manager, a cluster of network managers, or other set of components may perform the operations of the process 200. For instance, in some embodiments a network manager initiates the flow tracing operation while a separate visualization component of the network management and control system performs other operations to generate a visualization of the flow tracing operation.

As shown, the process 200 begins by initiating (at 205) a flow tracing operation through a logical network from a source endpoint to a destination endpoint. In some embodiments, a user of the network management and control system specifies (e.g., through a graphical user interface) for the flow tracing operation to be performed and specifies source and destination endpoints for the operation. In some embodiments, the source and destination endpoints are data compute nodes (e.g., VMs, containers, bare metal computing devices, etc.) in the logical network managed by the network management and control system. These source and destination endpoints may be located at the same site or different sites when the logical network spans multiple sites.

The network management and control system is also responsible for injecting flow tracing data messages at the source endpoint which are addressed to the destination endpoint as though the source endpoint had actually sent the data messages. These data messages are also marked with a flag (e.g., a single bit) that indicates that they are generated for the flow tracing operation. In some embodiments, this flag causes (i) PFEs that process the data messages to provide flow tracing data to the network management and control system and (ii) the PFE connected to the destination endpoint to process the data message but not actually deliver the data message to the destination endpoint. The flow tracing operations of some embodiments are described in greater detail in U.S. Pat. Nos. 9,197,529 and 10,805,239, both of which are incorporated herein by reference.

As a result of initiating the flow tracing operation, the process 200 receives (at 210) flow tracing data for the data message flow from PFEs that process the flow tracing operation data messages. These PFEs (e.g., software and/or hardware forwarding elements, such as edge devices as well as virtual switches and virtual routers executing on host computers) are configured by the network management and control system to send flow tracing data to a central controller or controller cluster at their respective site (e.g., directly, or via local controllers that execute on host computers alongside the PFEs). The flag bit used to mark the flow tracing operation data messages specifies for the PFEs to send flow tracing data in response to processing these data messages.

The PFEs, in some embodiments, send the flow tracing data to the controllers as control messages that indicate that the forwarding element has performed a specific action or actions. These actions can include physical receipt of a data message at a particular port, ingress of a packet to an LFE, logical forwarding of a data message according to an LFE, application of a firewall, access control, or other rule for an LFE to a data message, physical forwarding (e.g., encapsulation and output) of a data message, dropping a data message, delivery of a data message to its destination, etc. The controllers and/or network managers at each site aggregate the flow tracing data for a particular flow tracing operation and pass this data to a centralized controller and/or manager cluster in some embodiments (e.g., the network manager cluster that generates the visualization of the flow tracing operation). The centralized controller and/or manager cluster is thus able to use the data from all of the sites to generate the visualization of the processing of the data message flow across the multiple sites. Description of a multi-site network management and control system can be found in U.S. patent application Ser. No. 16/906,925, filed Jun. 19, 2020, which is incorporated herein by reference.

The process 200 then identifies (at 215) the sites from which flow tracing data was received. As mentioned, in some embodiments, the network managers and/or controllers at the sites provide flow tracing data to a centralized component of the network management and control system that generates a visualization of the flow tracing operation to provide to the user that requested the operation. Based on which sites provide flow tracing data, this central component can determine the sites that will be included in the visualization.

For each of these identified sites, the process 200 identifies (at 220) the LFEs that process the data messages at the site and the PFEs that implement these LFEs. In some embodiments, when a PFE provides flow tracing data for a particular operation, that operation specifies at least (i) the PFE and (ii) the LFE associated with the operation, if one exists (e.g., for a logical forwarding operation). Thus, when a PFE executing on the source host computer (i.e., the host computer at which the source endpoint executes) implements a logical switch by logically forwarding a data message to a logical router, that information is provided in a flow tracing control message. When the PFE encapsulates the data message to send the data message to an edge node at which the next hop SR executes, that information is also included in another flow tracing control message (or the same control message, for embodiments in which all of the operations are included in a single message from the PFE). Similarly, when an edge node performs logical routing for a particular SR, tunnels a data message to another site via an RTEP, performs logical switching, etc., this information is conveyed via flow tracing control messages.

Finally, the process 200 generates (at 225) a visualization of the flow tracing operation with a separate section for each identified site that indicates at least a subset of the LFEs that process the data messages at the site. This visualization is provided to the user within a graphical user interface (e.g., of a network management application). The process 200 then ends. In some embodiments, as shown in FIG. 1 , these sections are presented as vertical segments of the visualization display, with the section corresponding to the source site (i.e., the site at which the source endpoint is located) on one side and the section corresponding to the destination site (i.e., the site at which the destination endpoint is located) on the other side of the display. For instance, all of the examples shown in this application have the section corresponding to the source site on the left side of the display. If the destination site is different than the source site, these examples show the source site on the right side of the display.

FIG. 3 , however, shows a visualization 300 for a flow tracing operation in which data messages are sent from a first site to a second site and back to the first site in order to exit the network. In this case, the source endpoint is displayed in the bottom left (within a first section 305) and the visualization 300 shows the flow traveling from bottom to top, including left to right (to a second section 310) and then back to the left to the first section 305. As shown here, the source endpoint is a VM (“src_vm”) and the destination endpoint is an address (e.g., an IP address) external to the logical network. Similar to the flow shown in FIG. 1 , the PFE set at the host computer of the source VM performs processing for a logical switch (“paris_seg1”) for the data messages, which is shown in the visualization 300.

This logical switch logically forwards the data message flow to the T1 logical router “finance_t1”, for which Paris is a secondary site. As such, after the data message flow is tunneled to the edge device implementing the T1 SR within the Paris site, this secondary T1 SR routes the data message to the primary T1 SR (and sends this data message via the tunnel between logical network gateways for the backplane logical switch connecting these SRs), which is located in the New York site. As such, the visualization 300 shows the data message flow being sent via an RTEP to the primary T1, for which the GUI item is displayed in the New York section 310. Because the destination network address of these data messages is an external address, the T1 SR routes the data messages to the T0 SR (“corp_t0”), for which the New York site is a secondary site. This secondary T0 SR could be implemented on the same edge device as the primary T1 SR or a different edge device. In either case, the secondary T0 SR routes the data message flow to the primary T0 SR, which in this case is located at the Paris site.

Thus, rather than continuing to show the data message flow trending rightward, the visualization 300 displays the flow going back to the Paris section 305 (to a primary T0 SR, via the tunnel between logical network gateways for connecting these T0 SRs). Finally, within this section 305, the visualization displays a cloud icon to indicate that the data message flow is sent to external networks. While this example shows a single section 305 corresponding to the Paris site, other embodiments display multiple separate sections for a site through which the data message flow passes multiple times. In some such embodiments, a second section corresponding to the Paris site would be displayed to the right of the New York section 310, and would include the UI items for the primary T0 SR as well as the external network cloud icon. Some embodiments also use arrows to indicate the direction of the flow, which can be useful for such a flow.

As noted, some embodiments do not display all of the LFEs that process a data message flow in the visualization for that flow. In some embodiments, only user-configured LFEs are shown in the visualization. These include logical switches to which network endpoints connect, logical routers, and RTEPs. However, the internal logical components of a logical router are not shown in some embodiments (i.e., the DR and SR are not differentiated, and neither the logical switch that connects these components nor the backplane logical switch that connects SRs across sites are indicated except as an RTEP). As described above by reference to FIG. 1 , in some embodiments the visualization shows the connections between the components, includes the names of these LFEs (e.g., paris_seg1, finance_t1, etc.), and for logical routers that span multiple sites, indicates whether each site is the primary or secondary site for that logical router. Other embodiments might include additional information as well.

When a segment is stretched between sites and the source and destination endpoints are both connected to that segment, data messages are sent across sites via logical network gateways and the tunnel between these logical network gateways (the RTEP channel). Whereas the logical network gateways for a backplane logical switch between SRs of the same logical router are typically implemented as tunnel endpoints (RTEPs) on the same edge devices as the SRs, the logical network gateways for stretched logical switches are implemented as RTEPs on edge devices separate from the host computers that implement the logical switches for these data messages. Thus, rather than tunnel a data message directly from the source host computer at the source site to the destination host computer at the destination site, some such embodiments tunnel the data message from the source host computer to the edge device at the source site implementing the logical network gateway for the logical switch, which in turn tunnels the data message to the edge device at the destination site implementing the logical network gateway for the logical switch (via the RTEP channel between the sites), which then tunnels the data message to the destination host computer.

FIG. 4 illustrates a visualization 400 of a flow tracing operation for such a data message flow in which the source and destination endpoints connect to the same logical switch at different sites. As shown, two separate UI items are displayed for the logical switch (“stretched_seg1”), one in each of the sections 405 (for the Paris site) and 410 (for the New York site). In addition, an RTEP item is displayed on the boundary for the logical network gateways that process the data message. In other embodiments, because the logical network gateways are implemented on separate devices, two different items are shown on either side of the section boundary.

As described above, each LFE displayed in the flow tracing operation visualization is implemented by a PFE. While many of the LFEs are actually each implemented by many different PFEs at one or more sites, in the course of processing the data message flow each LFE as represented in the visualization is typically processed by one PFE (or possibly two PFEs). For example, a logical switch to which a source endpoint for a data message flow connects will typically be implemented (in the context of processing the data message flow) by the PFE set at the host computer on which the source endpoint executes. This PFE set also typically implements the logical router (i.e., the DR of the logical router) to which the logical switch connects, but if the data message flow is sent to an SR, then the edge device that implements the SR also implements the logical router.

In general, the logical network gateways and SRs are implemented on edge devices, while the logical switches shown in the flow tracing operation visualizations are implemented by the edge devices as well as the PFE sets executing on the source and/or destination endpoint host computers. In some embodiments, the visualization provides an option for a user to view the physical computing device (e.g., edge devices, host computers) that implements each of the displayed LFEs that processes the data message flow.

FIG. 5 illustrates a flow tracing operation visualization 500 with an option 520 selected to view the physical computing devices (e.g., edge devices, host computers that execute PFE sets) implementing each displayed LFE in the course of processing the flow tracing operation data messages. In some embodiments, this option is referred to as a “Fabric View” and is displayed along with the flow visualization as a selectable item that enables turning on or off the display of the physical devices. In other embodiments, the option may alternatively or conjunctively be selectable through a drop-down menu, keyboard shortcuts, etc., and may be referred to by other names.

As shown in the figure, some embodiments display the physical devices using a dashed line (as opposed to the straight lines used to represent the LFEs) or use a different mechanism to differentiate the physical devices from the LFEs (e.g., different colors, different fonts for the names, etc.). The display also shows the type of physical device (e.g., either host node or edge node) as well as the name for the physical device in the network management and control system (e.g., HN1, EN2, etc.). In addition, as shown in the figure, some embodiments display the name of the physical device within a selectable item (in this case, a dotted line circle). This selectable item (i.e., the items showing “HN1”, “EN1”, etc.) is selectable in some embodiments to view additional details about that physical device (e.g., the type of computing device, the group of computing devices to which it belongs, etc.). It should be understood that different embodiments display such selectable items for the physical devices in different manners (e.g., with or without the device name, as solid shaded shapes rather than dotted or dashed lines, etc.).

The visualization 500 shows a data message flow following the same path as in FIG. 1 , but with the physical device view option 520 selected. In the first section 505 (for the Paris site) the source VM and initial logical switch are shown as associated with a host computer HN1. For the logical routers, some embodiments display edge nodes for each SR that processes the data message flow (i.e., ignoring that the DRs are often implemented elsewhere, such as the host computer HN1). Thus, a first edge node EN1 is displayed for the secondary SR of the T1 logical router “finance_t1” within the Paris section 505 while a second edge node EN2 is shown for the primary SR of this logical router within the New York section 510.

In this case, the edge node EN2 also implements the secondary SR for the TO logical router “corp_t0” within the New York site. As in this example, some embodiments display two separate indicators for the same edge node when this occurs. However, other embodiments display a single indicator for an edge node when two consecutive SRs are implemented on the same edge node (e.g., an indicator that encompasses both of the SRs). Within the section 515 corresponding to the London site, two edge nodes EN3 and EN4 are displayed for the two SRs that process the packet at this site.

In addition, although the logical forwarding operations of the logical switch “london_seg1” are performed at the edge node EN4 for this data message flow, the UI item for the logical switch is displayed within the UI item for the destination host computer HN2 (the PFE set executing at this host computer does perform additional logical processing to deliver the data message to the destination endpoint “dst_vm”). Different embodiments, however, display the logical switch UI item within the indicator for the edge device EN4, as two separate UI items within both of the physical device indicators, or as a single item on the boundaries of these two physical device indicators. While the example shows the UI items representing the physical devices as encompassing the UI items representing the LFEs, other embodiments instead display indicators for the physical devices that abut the corresponding LFEs. It should be understood that many variations on the display shown in FIG. 5 are possible in different embodiments.

When a data message flow passes through numerous sites, the visualization of the flow tracing operation can become crowded and difficult to follow because so many sections are required. In addition, an administrator may not be interested in viewing all of the different secondary and primary SRs that process the data message flow. As such, some embodiments present the sections for intermediate sites (i.e., any site that is not the first or last site through which a data message flow passes) as collapsible within the user interface.

FIG. 6 illustrates a visualization 600 of a flow tracing operation for a data message flow that passes through five different sites and thus includes five sections 605-625. Here, the data message flow is sent from a source VM (“src_vm”) to a logical switch (“paris_seg1”) to a first T1 logical router (“finance_t1”) to a first TO logical router (“corp_t0”). The visualization for these LFEs spans three sections 605-615. The primary T0 SR located in the London site forwards the data message to an external network, through which the data message is delivered to a primary SR for another TO logical router (“web_t0”) at a Singapore site that is also managed by the network management and control system (which therefore receives flow tracing operation data from the edge device implementing the T0 SR). This external network is represented by a cloud icon displayed between the sections 615 and 620 corresponding to the London and Singapore sites, respectively. From here, the data message flow is sent to the Los Angeles site, where it is processed by a secondary T0 SR, the primary SR for a second T1 logical router (“web_t1”), a second logical switch (“losangeles_seg1”), and would be delivered to the destination VM (“dst_vm”) if the flow were not for a flow tracing operation.

In this example, each of these sections is fully displayed. However, when the number of sites is above a threshold (e.g., more than four sites, more than five sites, etc.), some embodiments automatically display some of the sections as collapsed while providing a selectable item for expanding the section. FIG. 7 illustrates a visualization 700 of a flow tracing operation for a data message flow that passes through five different sites (similar to that shown in FIG. 6 ) but with the intermediate sections collapsed. In this case, the sections 705 and 710 corresponding to the source (Paris) and destination (Los Angeles) sites are displayed in full with the LFE indicators shown. For the intermediate New York, London, and Singapore sites, however, the visualization 700 displays only expandable UI items 715-725 along with header text specifying the site locations. This significantly reduces the clutter displayed within the visualization 700.

In some embodiments, as shown, vertical dashed lines (or similar items used to separate sections of the visualization) are not displayed between expanded sections and collapsed sections. Rather, lines are used to indicate that the flow passes from one of the LFEs in the expanded section (e.g., the primary T1 SR in the Paris section 705) to the site represented by the collapsed section. In some embodiments, these lines are always shown in the middle of the visualization. In other embodiments, as with the line passing from the UI item 725 to the secondary T0 SR item in the Los Angeles section 710, these lines are displayed so as to connect to the previous/next LFE in the flow.

When a user selects one of the expandable UI items, the visualization is modified to display the corresponding section with the LFEs and their connections. FIG. 8 illustrates the flow tracing operation visualization 700 of FIG. 7 after one of the sections has been expanded. Specifically, a user has selected the expandable item 715 such that the section 800 is now displayed for the New York site, showing UI items for the primary SR of the T1 logical router “finance_t1” as well as the secondary SR of the TO logical router “vmw_t0”. In addition, the RTEP item for the logical network gateways connecting the two SRs of the T1 logical router is now shown on the boundary between the sections 705 and 800. Above the section 800 the visualization also displays a selectable UI item 805 that enables the user to collapse this section 800 (i.e., back into the expandable item 715). Other embodiments do not automatically collapse any of the sections in the initial display of the flow tracing operation, but instead provide selectable items that enable collapse of each section (or each intermediate section), such as the item 805.

The flow tracing operation visualizations of some embodiments can include additional details about the data message flow and the LFEs and physical elements that process the data message flow, either as part of the standard display or as options for a user to select. FIG. 9 illustrates a visualization 900 of a flow tracing operation in which a user has selected options for the visualization to include (i) information indicating the latency between datacenters and (ii) indications as to which LFEs currently have alarms raised. These appear in the user interface as selectable items 905 and 910, which are currently shown as selected.

The selectable item 905 causes the display of latency information between the sections of the visualization 900. The visualization 900, in this example, is the same flow tracing operation as that shown in FIG. 1 , with three sections 915-925 corresponding to three sites (Paris, New York, and London, respectively) through which the data message flow passes. The latency information shown in the visualization indicates the time that the data messages take to be sent from one site to another. In some embodiments, this information is based on timestamps in the control messages sent by the PFEs implementing the logical network gateways in each site that send the data messages between the sites. Here, the information shows latency of 1.8 ms between Paris and New York and 2.6 ms between New York and London. When the flow tracing operation involves sending multiple data messages, the latency information may be a mean latency, median latency, maximum latency, or other measurement.

The selectable item 910 causes the display of alarm information in the visualization 900. In this case, two LFEs have alarms raised: the secondary SR at the New York site for the TO logical router “corp_t0” and the primary SR at the London site for the T1 logical router “it_t1”. The UI items 930 and 935 for these LFEs are displayed in a different manner than the other LFEs so as to highlight that alarms are raised for the LFEs. In different embodiments, the presence of raised alarms may be shown by bolding or otherwise changing the font used within the UI items, changing the color, saturation, or other display characteristics of the UI items, making the UI items larger, etc. Alarms, in some embodiments, are events that the network management and control system detects and raises to indicate that the performance and system operation of a component (e.g., an LFE, PFE, etc.) is potentially affected. For instance, an edge device could be experiencing unusually high CPU usage or low available disk space, which could raise alarms on the physical edge device as well as the SRs implemented by that edge device. In some embodiments, the UI items indicating alarms raised are also selectable to provide information about the number of alarms raised, the types of alarms raised, etc.

FIG. 10 conceptually illustrates an electronic system 1000 with which some embodiments of the invention are implemented. The electronic system 1000 may be a computer (e.g., a desktop computer, personal computer, tablet computer, server computer, mainframe, a blade computer etc.), phone, PDA, or any other sort of electronic device. Such an electronic system includes various types of computer readable media and interfaces for various other types of computer readable media. Electronic system 1000 includes a bus 1005, processing unit(s) 1010, a system memory 1025, a read-only memory 1030, a permanent storage device 1035, input devices 1040, and output devices 1045.

The bus 1005 collectively represents all system, peripheral, and chipset buses that communicatively connect the numerous internal devices of the electronic system 1000. For instance, the bus 1005 communicatively connects the processing unit(s) 1010 with the read-only memory 1030, the system memory 1025, and the permanent storage device 1035.

From these various memory units, the processing unit(s) 1010 retrieve instructions to execute and data to process in order to execute the processes of the invention. The processing unit(s) may be a single processor or a multi-core processor in different embodiments.

The read-only-memory (ROM) 1030 stores static data and instructions that are needed by the processing unit(s) 1010 and other modules of the electronic system. The permanent storage device 1035, on the other hand, is a read-and-write memory device. This device is a non-volatile memory unit that stores instructions and data even when the electronic system 1000 is off. Some embodiments of the invention use a mass-storage device (such as a magnetic or optical disk and its corresponding disk drive) as the permanent storage device 1035.

Other embodiments use a removable storage device (such as a floppy disk, flash drive, etc.) as the permanent storage device. Like the permanent storage device 1035, the system memory 1025 is a read-and-write memory device. However, unlike storage device 1035, the system memory is a volatile read-and-write memory, such a random-access memory. The system memory stores some of the instructions and data that the processor needs at runtime. In some embodiments, the invention's processes are stored in the system memory 1025, the permanent storage device 1035, and/or the read-only memory 1030. From these various memory units, the processing unit(s) 1010 retrieve instructions to execute and data to process in order to execute the processes of some embodiments.

The bus 1005 also connects to the input and output devices 1040 and 1045. The input devices enable the user to communicate information and select commands to the electronic system. The input devices 1040 include alphanumeric keyboards and pointing devices (also called “cursor control devices”). The output devices 1045 display images generated by the electronic system. The output devices include printers and display devices, such as cathode ray tubes (CRT) or liquid crystal displays (LCD). Some embodiments include devices such as a touchscreen that function as both input and output devices.

Finally, as shown in FIG. 10 , bus 1005 also couples electronic system 1000 to a network 1065 through a network adapter (not shown). In this manner, the computer can be a part of a network of computers (such as a local area network (“LAN”), a wide area network (“WAN”), or an Intranet, or a network of networks, such as the Internet. Any or all components of electronic system 1000 may be used in conjunction with the invention.

Some embodiments include electronic components, such as microprocessors, storage and memory that store computer program instructions in a machine-readable or computer-readable medium (alternatively referred to as computer-readable storage media, machine-readable media, or machine-readable storage media). Some examples of such computer-readable media include RAM, ROM, read-only compact discs (CD-ROM), recordable compact discs (CD-R), rewritable compact discs (CD-RW), read-only digital versatile discs (e.g., DVD-ROM, dual-layer DVD-ROM), a variety of recordable/rewritable DVDs (e.g., DVD-RAM, DVD-RW, DVD+RW, etc.), flash memory (e.g., SD cards, mini-SD cards, micro-SD cards, etc.), magnetic and/or solid state hard drives, read-only and recordable Blu-Ray® discs, ultra-density optical discs, any other optical or magnetic media, and floppy disks. The computer-readable media may store a computer program that is executable by at least one processing unit and includes sets of instructions for performing various operations. Examples of computer programs or computer code include machine code, such as is produced by a compiler, and files including higher-level code that are executed by a computer, an electronic component, or a microprocessor using an interpreter.

While the above discussion primarily refers to microprocessor or multi-core processors that execute software, some embodiments are performed by one or more integrated circuits, such as application specific integrated circuits (ASICs) or field programmable gate arrays (FPGAs). In some embodiments, such integrated circuits execute instructions that are stored on the circuit itself.

As used in this specification, the terms “computer”, “server”, “processor”, and “memory” all refer to electronic or other technological devices. These terms exclude people or groups of people. For the purposes of the specification, the terms display or displaying means displaying on an electronic device. As used in this specification, the terms “computer readable medium,” “computer readable media,” and “machine readable medium” are entirely restricted to tangible, physical objects that store information in a form that is readable by a computer. These terms exclude any wireless signals, wired download signals, and any other ephemeral signals.

This specification refers throughout to computational and network environments that include virtual machines (VMs). However, virtual machines are merely one example of data compute nodes (DCNs) or data compute end nodes, also referred to as addressable nodes. DCNs may include non-virtualized physical hosts, virtual machines, containers that run on top of a host operating system without the need for a hypervisor or separate operating system, and hypervisor kernel network interface modules.

VMs, in some embodiments, operate with their own guest operating systems on a host using resources of the host virtualized by virtualization software (e.g., a hypervisor, virtual machine monitor, etc.). The tenant (i.e., the owner of the VM) can choose which applications to operate on top of the guest operating system. Some containers, on the other hand, are constructs that run on top of a host operating system without the need for a hypervisor or separate guest operating system. In some embodiments, the host operating system uses name spaces to isolate the containers from each other and therefore provides operating-system level segregation of the different groups of applications that operate within different containers. This segregation is akin to the VM segregation that is offered in hypervisor-virtualized environments that virtualize system hardware, and thus can be viewed as a form of virtualization that isolates different groups of applications that operate in different containers. Such containers are more lightweight than VMs.

Hypervisor kernel network interface modules, in some embodiments, is a non-VM DCN that includes a network stack with a hypervisor kernel network interface and receive/transmit threads. One example of a hypervisor kernel network interface module is the vmknic module that is part of the ESXi™ hypervisor of VMware, Inc.

It should be understood that while the specification refers to VMs, the examples given could be any type of DCNs, including physical hosts, VMs, non-VM containers, and hypervisor kernel network interface modules. In fact, the example networks could include combinations of different types of DCNs in some embodiments.

While the invention has been described with reference to numerous specific details, one of ordinary skill in the art will recognize that the invention can be embodied in other specific forms without departing from the spirit of the invention. In addition, a number of the figures (including FIG. 2 ) conceptually illustrate processes. The specific operations of these processes may not be performed in the exact order shown and described. The specific operations may not be performed in one continuous series of operations, and different specific operations may be performed in different embodiments. Furthermore, the process could be implemented using several sub-processes, or as part of a larger macro process. Thus, one of ordinary skill in the art would understand that the invention is not to be limited by the foregoing illustrative details, but rather is to be defined by the appended claims. 

1-25. (canceled)
 26. A method for presenting a visualization of a data message flow within a logical network that is implemented across a plurality of sites including first and second sites, the method comprising: receiving flow tracing data regarding the data message flow from a first endpoint in the first site to a second endpoint in the second site, the data message flow processed according to a group of one or more logical forwarding elements (LFEs) implemented in the first and second sites; for each of the first and second sites through which the data message flow passes, identifying a set of one or more LFEs that processes the data message flow in the site; and presenting, for the data message flow, a visualization comprising a separate section for each site through which the data message flow passes, each section indicating a subset of one or more LFEs that processes the data message flow in the corresponding site for the section.
 27. The method of claim 26, wherein receiving flow tracing data comprises receiving flow tracing data from a plurality of physical forwarding elements that implement the set of LFEs that process the data message flow at each site through which the data message flow passes.
 28. The method of claim 26, wherein each set of LFEs comprises a set of one or more logical routers or a set of one or more logical switches.
 29. The method of claim 28, wherein: a particular logical switch is implemented across at least two of the set of sites through which the data message flow passes; and when the source endpoint and the second endpoint are both connected to the particular logical switch, the visualization includes a first GUI item for the particular logical switch in the first site, a second GUI item for a set of remote tunnel endpoints (RTEPs) used to forward the data message between the first and second sites, and a third GUI item for the particular logical switch in the second site.
 30. The method of claim 26, wherein the visualization comprises an indicator of latency between the first site and the second site, the indicator displayed between a first section for the first site and a second section for the second site.
 31. The method of claim 26, wherein the visualization identifies a set of LFEs with one or more alarms triggered by displaying GUI items for the set of LFEs using an indicator.
 32. The method of claim 26, wherein the visualization indicates a direction of the data message flow.
 33. The method of claim 26, wherein the separate sections in the visualization are displayed in an order starting with a first section for the first site and ending with a second section for the second site, with sections for each site through which the data message flow passes between the first and second sites displayed in order between the first and second sections.
 34. The method of claim 33, wherein the sections for each site through which the data message flow passes between the first and second sites are presented as collapsible within the visualization.
 35. The method of claim 34, wherein: within a particular section for a particular site, each LFE indicated as processing the data message flow within the particular site is presented as a separate GUI item; and when the particular section is collapsed, the GUI items for the LFEs that process the data message flow at the particular site are not displayed.
 36. The method of claim 33, wherein when a number of sites through which the data message flow passes between the first and second sites is above a threshold, the sections for the sites through which the data message flow passes between the first and second sites are automatically presented as collapsed within the visualization.
 37. The method of claim 34, wherein each section that is collapsed within the visualization is expandable, wherein expanding a particular section corresponding to a particular site causes the visualization to display GUI items for at least a subset of the LFEs that process the data message flow in the particular site.
 38. A non-transitory machine-readable medium storing a program which when executed by at least one processing unit presents a visualization of a data message flow within a logical network that is implemented across first and second sites, the program comprising sets of instructions for: receiving flow tracing data regarding the data message flow from a first endpoint in the first site to a second endpoint in the second site, the data message flow processed according to a group of one or more logical forwarding elements (LFEs) implemented in the first and second sites; for each of the first and second sites through which the data message flow passes, identifying a set of one or more LFEs that processes the data message flow in the site; and presenting, for the data message flow, a visualization comprising a separate section for each site through which the data message flow passes, each section indicating a subset of one or more LFEs that processes the data message flow in the corresponding site for the section.
 39. The non-transitory machine-readable medium of claim 38, wherein the set of instructions for receiving flow tracing data comprises a set of instructions for receiving flow tracing data from a plurality of physical forwarding elements that implement the set of LFEs that process the data message flow at each site through which the data message flow passes.
 40. The non-transitory machine-readable medium of claim 38, wherein each set of LFEs comprises a set of one or more logical routers or a set of one or more logical switches.
 41. The non-transitory machine-readable medium of claim 40, wherein: a particular logical switch is implemented across at least two of the set of sites through which the data message flow passes; and when the source endpoint and the second endpoint are both connected to the particular logical switch, the visualization includes a first GUI item for the particular logical switch in the first site, a second GUI item for a set of remote tunnel endpoints (RTEPs) used to forward the data message between the first and second sites, and a third GUI item for the particular logical switch in the second site.
 42. The non-transitory machine-readable medium of claim 38, wherein the visualization comprises an indicator of latency between the first site and the second site, the indicator displayed between a first section for the first site and a second section for the second site.
 43. The non-transitory machine-readable medium of claim 38, wherein the visualization identifies a set of LFEs with one or more alarms triggered by displaying GUI items for the set of LFEs using an indicator.
 44. The non-transitory machine-readable medium of claim 38, wherein the visualization indicates a direction of the data message flow.
 45. The non-transitory machine-readable medium of claim 38, wherein the separate sections in the visualization are displayed in an order starting with a first section for the first site and ending with a second section for the second site, with sections for each site through which the data message flow passes between the first and second sites displayed in order between the first and second sections. 